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We introduce a class of weighted graphs whose properties are meant to mimic the topological 
features of idiotypic networks, namely the interaction networks involving the B-core of the immune 
system. Each node is endowed with a bit-string representing the idiotypic specificity of the cor- 
responding B cell and a proper distance between any couple of bit-strings provides the coupling 
strength between the two nodes. We show that a biased distribution of the entries in bit-strings 
can yield fringes in the (weighted) degree distribution, small- worlds features, and scaling laws, in 
agreement with experimental findings. We also investigate the role of ageing, thought of as a pro- 
gressive increase in the degree of bias in bit-strings, and we show that it can possibly induce mild 
percolation phenomena, which are investigated too. 
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I. INTRODUCTION 

Network theories are becoming fundamental pillars of 
modern approaches to describe reality in all fields of sci- 
ence, ranging from quantitative sociology [3JJ4] to sys- 
temic biology [SHE]. As for the latter, immunology is one 
of the few fields of science where the importance of un- 
derlying networks of interactions were stressed directly 
by a few immunologists as early as half a century ago 
[9] [10]: The importance of refined models for the core 
of B-cell interactions is fundamental in order to prop- 
erly address a possible regulatory role of the idiotypic 
network in "systemic" outcomes of the immune system, 
such as self-nonself discrimination [TTHT31 [T5] . Further- 
more, the recent goals obtained by monoclonal antibody 
techniques in clinical therapies [16] encourage the devel- 
opment of a clear theoretical backbone to control, for 
instance, the preparation and the effects of antibodies 
functionally mirroring proper enzymes [17] or the prepa- 
ration of proper vaccinations against autoimmune mani- 
festations (e.g. SLA) [18]. 

All this information continues to fascinate physicists 
and mathematicians who early started to develop the- 
oretical models shaped to the increasing availability of 
read data (see e.g. p~4j[T9]) an d continue to update plau- 
sible frameworks for these B-cell interactions (see e.g. 
[2QH23] and references therein). 

This paper presents a detailed study of the topolog- 
ical properties, and consequent structural implications, 
of a model for the B-cell branch of adaptive immunity, 
previously developed in (24J [25]. Here, inspired by re- 
cent biological findings [26] , we introduce a tunable bias 
among antibodies at the epitopal (idiotypic) level, which 
aims to mimic the non purely random genesis of these 
proteins. The inclusion of this feature has crucial conse- 
quences for the topology structure such as the emergence 
of fringes (i.e., multimodularity) in the weighted degree 



distribution; moreover, while with further increase in the 
correlation, the network gets sparser and sparser (possi- 
bly reaching underpercolated regimes) [27], and proper 
scaling laws can be detected, all in agreement (at least 
qualitatively) with experimental results [12]. In partic- 
ular, fringes in degree distribution were observed in the 
first studies p~2j [28] of antibody networks in mice and a 
structural role of these fringes was speculated: antibodies 
corresponding to nodes with a higher (weighted) coordi- 
nation number would act as inhibitors to autoimmune 
errors (self-attaching), while those with a lower coordi- 
nation number would act as soldiers against pathogens. 
Moreover, a relation between the overall similarity (i.e., 
the degree of bias) among antibody structures and the 
average coordination number has also been extensively 
reported [27H30] , and it suggests the following specula- 
tion: the B-repertoire of an immune system of newborn 
mice is essentially random (so as to be highly adaptable 
to the outside reality), hence displaying a high connectiv- 
ity. As infections are encountered, the network special- 
izes its army, increasing antibody similarities [57] and, as 
a consequence, the network connectivity decreases, con- 
sistent with experimental findings in adult mice. 



In this paper, after a streamlined introduction on the 
B-core of the immune system (Sec. [ll|, we introduce a 
class of weighted graphs meant to describe idiotypic net- 
works (Sec. III). Then, such a model is investigated for 
its global topology (Sec. IV), including degree distribu- 
tion and clustering features, and as for the distribution of 
weights associated witth its links (Sec. V). Next the age- 
ing of the system by increasing the degree of bias among 
bit strings; this process induces a progressive dilution 
of the network and the resulting percolation is analyzed 
(Sec. VI). Finally , we discuss our results and possible 
perspectives (Sec. VII). 
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II. A GLANCE AT ADAPTIVE IMMUNITY 

Mammalian immune system is a complex ensemble of 
interacting cells and exchanging molecules, whose scope 
is to prevent the host body from infection and damage by 
pathogens, either external (e.g. bacteria, fungi, viruses) 
or internal (e.g. cancerous cells). To satisfy this task, 
specific soldiers, each with specific weapons, were devel- 
oped. The best known troop in this army is probably the 
family of B-cells, whose weapons are the antibodies (also 
known as immunoglobulin) they secrete. 

In a nutshell, B-cells are divided into clones; each clone 
is made of cells producing the same antibody, and its size 
(the number of identical cells within the clone) varies over 
several orders of magnitude as a function of external stim- 
ulation. Consequently, the ensemble of all clones mirrors 
the repertoire of all possible antibodies, which, in order 
to cover a wide range of responses, must be enormously 
variegate [e.g. in humans it is (9(10 10 )]. In the following, 
each antibody is thought of as a bit string of fixed length, 
whose entries represent the presence (1) or absence (0) 
of the corresponding idiotope/paratope, i.e., the specific 
module devoted to the interaction and recognition of an 
epitope, a piece of an antigen. The latter is, by defini- 
tion, any molecule able to trigger an immune response 
and can also be thought of as a bit-string of the same 
length [24]. 

Now, following the pioneering investigation by Bur- 
net [31-33 known as "clonal selection theory", (and as- 
suming for the sake of clarity a binary representation in 
terms of bitstrings of information for both antigens and 
antibodies [14] that is deepened later), when an antigen 
(e.g. 01101100) is introduced into the body, the B-cell 
expressing a high-matching antibody (e.g. 10010011), af- 
ter exposure to the antigen itself, starts the replicative 
process, increasing the size of the clone, and all its cells 
secrete antibodies in a large quantity. The latter, being 
soluble molecules, rapidly explore the body deleting the 
pathogen. After the removal, the excess lymphocytes un- 
dergo apoptosis, shrinking the size of the clone to normal 
values [58] . 

One step forward, following the idiotypic network rep- 
resentation by Jerne [10 , the antibody produced at a 
high concentration can itself be identified as a "stranger" 
in the host body (as in the past it never appeared macro- 
scopically, for otherwise, the pathogen could not affect 
the host already vaccinated) and trigger a response by 
another antibody with complementary entries to itself 
(the ant i- antibody, e.g., 10011100, namely, an external 
reproduction of the pathogen, but devoid of dangerous 
DNA/RNA), and so on, giving rise to the so-called "an- 
tibody cascade" commonly seen in experiments (see e.g., 
[34] [35] and references therein). When an antibody is 
considered an antigen, the portion of the combinatorial 
site which is recognized by another antibody is called the 
idiotope. 

Another step forward, approaching the Varela theory 
[TTJ [12] , is to allow the same antibody to be recognized 



by several clones, namely, the idea of "mirror of mir- 
ror" is enlarged into an affinity matrix, in such a way 
that an antibody represented by a string, say (0110110), 
is reactive not only with its mirror (1001001), but also 
with (1001000), even though to a smaller extent. This 
gives rise to a network whose nodes are B-cells and links 
connect clones secreting well-matching immunoglobulins. 
This network has been shown to posses a regulatory role 
emerging from its non trivial topology and coupling dis- 
tribution. Some of these features have already been evi- 
denced in the context of unbiased antibody distributions, 
e.g. low-dose tolerance [36], generation of memory cells 
[25] and a unifying framework where the Burnet, Jerne, 
and Varela theories synergistically co-operate [25j [36]. 
On the other hand, as we show here, some experimental 
evidences can be captured by biasing the bit strings rep- 
resenting the epitopal expression of antibodies. Hence, 
we address questions such as, What is the underlying 
topology, if any? How does the topology affect the re- 
sponses of the system? Does the network live above the 
percolation threshold, or is it made of several indepen- 
dent clusters? How do these quantities evolve with time? 
These are questions of crucial importance within a sys- 
temic approach to our understanding of immunological 
reality. 



It should be remarked that, given the huge amount 
of antibodies making up the repertoire, and given the 
high intrinsic component of randomness in their genesis 
at the genetic level [38] , a pure random, unbiased forma- 
tion of antibodies can be justified in a first approxima- 
tion. Nonetheless, this approach underlies some limits 
which are now briefly summarized. First, we notice that 
proteins (in a immunological context, antibodies) purely 
randomly generated would not fold (or at least not all) 
into stable and functional structures [37]. Moreover, dur- 
ing the ontogenesis, each new B cell generated (and con- 
sequently each new antibody) is tested against self: in 
the bone marrow, newborn lymphocytes able to interact 
strongly with self molecules are deleted so to avoid au- 
toimmune reactions [38]. As a consequence of this learn- 
ing process, the final repertoire is biased. A further bias 
is due to the continuous antigenic stimulation and con- 
sequent production and reshape of optimal antibodies. 
Therefore, antibody structures get biased both through 
their genomic origin and as a result of the several external 
stimuli (adaptation to evolution). The latter effect sug- 
gests that the "effective degree of bias" can be considered 
as a raw measure of aging in a mature system: as time 
elapses and new external antigens are experienced, the 
repertoire gets more and more specific and, accordingly, 
more and more biased; correspondingly the average con- 
nectivity is expected to decrease. In fact, for instance, in 
[29] , the authors show that the high connectivity of new- 
born antibodies is lost in adult mitogen-reactive B-cell 
repertoires. 
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III. THE MODEL 

In a healthy human body at rest, it is estimated that 
the total number of clones generated from a single B-cell 
(and therefore exhibiting the same idiotypicity) is about 
10 2 to 10 4 and that the total number of B-cells amounts 
to some 10 12 — 10 14 , hence, the distinct clones is about 
10 8 — 10 12 . As for antibodies, their number is about 10 18 
and the number of idiotopes/paratopes [59] from which 
they are built, determining the idiotypicity of a given 
antibody, is estimated to be relatively small, i.e., of the 
order of 10 2 - 10 3 . 

For our purposes, immunoglobulins (Igs) and antigens 
can be described in the same way, namely, by means 
of a binary string £ encoding the particular sequence 
of idiotopes/paratopes that characterize their structure. 
Therefore, by comparing the strings & and £j of two given 
agents, labeled i and j, respectively, one can determine 
their degree of affinity and state whether they are likely 
to recognize each other and, if so, how strong the inter- 
action is. 

Now, the specificity encoded by & will characterize not 
only antibodies but also the clone, labeled z, secreting 
that kind of antibodies. In particular, our system is made 
up of an ensemble of M x N lymphocytes, which can 
occur in two states: quiescent, i.e., secreting a low dose of 
Ig; and firing, i.e., secreting high doses of Ig. The status 
of a given lymphocyte is therefore described by the binary 
variable o\ = ±1, where i = 1,...,7V distinguishes the 
idiotipicity, while i = 1, M, distinguishes lymphocytes 
belonging to the same i th clone (in principle, M may 
depend on the clone considered): a\ = +1 corresponds 
to the firing state, while o\ = —1 corresponds to the 
quiescent state. 

In this way, if immunoglobulins ^ and £j are affine 
enough, the corresponding clones i and j can be thought 
to be in "interaction", because a firing state of clone 
z, yielding a high concentration of immunoglobulin 
will be detected by lymphocytes belonging to clone j, 
which will react properly. These facts can be envisaged 
by means of a network of mutually interacting lympho- 
cytes. 

Let us now introduce the main definitions of the 
model. Antibodies and antigens are modeled as bi- 
nary strings representing the possible expression of L id- 
iotopes/paratopes. The assumption that each antibody 
can be thought of as a string of the same length L is 
based on the observation that "all the gamma-globulins 
have structural characteristic surprisingly similar" [39] . 
Hence, the L idiotopes 

e = (i,o,o,...,o), 
e = (o,i,o,...,o), 

C L = (o,o,o,...,i), 

may act as a base in such a way that a generic antibody & 
can be decomposed as a linear combination ^ = X]^ 1 + 



A 2 £ 2 + ... + Af£ L , with Af G (0,1) accounting for the 
expression (1) of the /ith idiotope on the ith antibody or 
its absense (0). 

We introduce biased strings by assuming that each en- 
try \i of the i th string is extracted randomly according to 
the distribution 

p«r = u = ^,p(tf=o) = ^, a) 

with a G [—1, +1]. In this way, when a = 0, we recover 
the previous unbiased model [24] and, in general, the 
average similarity between a pair of strings can be tuned 
via a as (ff ~ (1 + a) 2 /4. As we will see, Eq. 
provides a basic way to bias the repertoire, which allows 
us to study the direct effects on the network performance; 
more refined models can of course be obtained. 

Given a couple of clones, say i and j, the /ith en- 
tries of the corresponding strings are said to be com- 
plementary, iff Therefore, the number of 
complementary entries Xij £ [0? ^] can be written as 

xa = ELiKra - + #(i - o f c ° urse > xa 

strongly depends on the correlation parameter a and, in 
turn, it directly affects the affinity between i and j. In 
fact, the non-covalent forces acting among antibodies de- 
pend on the geometry, on the charge distribution and 
on hydrophilic-hydrophobic effects which give rise to an 
attractive (repulsive) interaction for any complementary 
(non-complementary) match. In principle, once the pro- 
tein folding problem is solved [37] , the whole analysis 
of this kind of network could be extremely simplified by 
directly studying the VDJ genes and their reshuffling; 
however, as this bridge among micro and meso biological 
scenarios is lacking, we rely on "effective descriptions." 
In particular, we assume that each complementary /non- 
complementary entry yields an attractive/repulsive con- 
tribution [40] [41]; the ratio between their intensities is 
denoted by the positive parameter a. Hence, we intro- 
duce the measure for the affinity between & and £j, 

/<*,«,!,(&, = [otXij ~{L- Xij)], ( 2 ) 

which ranges from — L (when ^ = £j) to aL (when all 
entries are complementary, i.e. £f = 1 — \i = 1, L). 
Now, when the repulsive contribution prevails, that is, 
fa,a,L < 0, the two antibodies do not see each other 
and the coupling among the corresponding lymphocytes 
Jij(a, a, L) is set equal to 0; conversely, when f a ,a,L > 0, 
we take as J^- the exponential of the affinity. This choice 
is the simplest trial able to mimic a key-lock mechanism 
for a sharp pattern recognition. Thus, we have: 

Jij(a,Ot,L) = @(fa,a,L(&i£j\ a ))' 

■exp[/ 0>a>L (6,^|a)], (3) 

where Q(x) is the discrete Heaviside function returning 
1, if x > 0, and 0, if x < [60]. Indeed, the ex- 
pression in Eq. Q ensures that the coupling strength 
among lymphocytes spans several orders of magnitude 
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(Jij G [0, exp(aL)]), as expected from experimental re- 
sults p] . 

One could possibly introduce a proper normalization in 
order to fix an average value for the coupling strength, 
which in turn fixes a scale for the level of noise ruling 
the thermodynamics of the system. This procedure is 
allowed due to the fact that, as the system is a ferromag- 
net, the average coupling is positive definite and that, 
for a given size TV, couplings display an upper bound. 
Nonetheless, the normalization is somehow arbitrary and 
does not qualitatively affect the behavior of the system. 
As for our current aims, we can neglect this and take 
Eq. (§ as an effective definition for the coupling strength 
between node i and node j. 

We also stress that N and L are intrinsically connected 
to each other. This can be easily seen in the case where 
the match among antibodies had to be perfect for recip- 
rocal recognition; then, in order to reproduce all possible 
antibodies obtained by the L idiotopes/paratopes, the 
immune system would need N = 2 L lymphocytes. Here, 
having relaxed the hypothesis of perfect match, only a 
fraction of this quantity needs to be retained to manage 
the whole repertoire, and we can introduce the following 
scaling between the number of all possible idiotypically 
different lymphocytes and the effective size of the reper- 
toire: 

L = 7 logiV, (4) 

where 7 > 0. 

In order to quantify immune responses one can intro- 
duce the set of observables 

1 M 

L—l 

where i labels the clone, and 1 labels the lymphocyte 
within each clone. The thermodynamic of the system 
can then be described by means of an Hamiltonian, 

^ N,N N 

H = -— ^2 Jijmrrij - c^h^ra*, (6) 

i<j i 

where the first term on the right hand side represents the 
mutual interaction among lymphocytes, while the sec- 
ond term accounts for the interaction with an antigen k 
present at concentration c and whose coupling with the 
z-th clone is h\ (see [24[ [44] for further details). Inter- 
estingly, these two terms encode the Jerne and Burnet 
theories, respectively. 

IV. GLOBAL TOPOLOGY 

If we forget, for the moment, the weights of the cou- 
plings, the N different B clones, interacting pairwise, de- 
fine a graph Q = {V,r}, where V denotes the set of 
nodes and T the set of links. The cardinality of V is 




FIG. 1: (Color on line) Link probability Pn n k(pi, Pj] ck, L) as 
a function of pi and pj with L — 10 and a = 0.5 (left) or 
a = 1.1 (right). 

given by \V\ = N, that is the total amount of idiotyp- 
ically different clones. The topological properties of Q 
are completely determined by the adjacency matrix A 
defined as = 1 if ^ and A^ — if = 0. 
For instance, the degree of a node i (i.e. coordination 
number) is given by %i = ^Zj e y Mj- In the following, we 
provide the main definitions and formula to describe the 
topology of the emergent graph and later we will deepen 
its global features. 

First, let us introduce the probability that a string & 
displays p non null entries; this follows a binomial distri- 
bution 



P(p;a,L) = 




Correspondingly, the probability that two strings & and 
£j, displaying pi and pj non- null entries respectively, ex- 
hibit x complementary matches is 

^ Pi-Pj+X ^j j ^ Pj-Pi+X ^ j ^ Pi+Pj-X ^ j ^ _ Pi+Pj+X^j j ' 

Now, the link probability Pn n k(Pi, Pj] L) can be ob- 
tained by summing P(x; p%, pj) over the values of x 
compatible with (pi,pj) and such that f a ,L((,i,(,j) > 0, 
namely, recalling Eq. (|8|, 

min(pi+pj , 2L — pi — pj) 

F\mk(puPjia,L) = ^ P (X]Pi,Pj)- (9) 

X=max(|pi-pj|, r^pil) 

Figure [I] shows an example of Pn n k(Pi, Pj]&,L) for differ- 
ent choices of the parameter a. 

By averaging Pi in k(pi, Pj] ol, L) over p i? we get the av- 
erage link probability for a node whose string displays pi 
non- null entries, that is, 

L 

P\ink(pi;ct,a,L) = ^2 P(pj] a, L)Pi ink (pi, pj; a, L), 

Pj =o 

(10) 
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FIG. 2: (Color on line) Link probability Pu n ^(pi; a, a, L) as 
a function of pi and a, while a and L are fixed. In the left 
panel a = 0.5 while in the right panel a = 1.1; in both cases 
L — 10. It is straightforward to see that on the line a — the 
probability link is independent on pi. When a / 0, the link 
probability changes with varying pi. Due to symmetry, only 
the range a G [0, 1] is shown. 



from which the average degree for j reads 

z(pn a, a, P, A/") = NPi ink (pi; a, a, P). (11) 

Numerical calculations of Pn n k(p; a, a, L) are shown in 
Fig. 2: Notice that a uniform bit distribution within the 
antibodies (i.e. a = 0) corresponds to an unbiased graph, 
where the average link probability of a node does not de- 
pend on the pertaining string. 

By further averaging Pn n k(Pi; ol, P) over p^ we get the 
average link probability for an arbitrary node of the sys- 
tem 

L,L 

Pi ink (a,a,P) = ^2 P\iTfc{puPj\OL,L) x 

xP( Pi ;a,L)P( Pj ;a,L), (12) 

from which the average coordination number follows as 

z(a, a, P, N) = NP link (a, a, P). (13) 

Finally, within a mean-field approach, we can use 
Piink(p; a, a, P) to write the degree distribution: 

P&e & ee{z\p\a,a,L,N) = z ^ [Piink(p;a,a,L)} z 
x [1 - P\ink(p;a,a,L)] N ~ 1 ~ z , (14) 
and, by further averaging over p, 



Pde g ree(>;a,a,L, TV) = ^2 P degree (z\p; a, a, L, N) 

x P(p;a,P). 

A. Multimodal degree distribution 



(15) 



As shown by Eq. ( 15 ), Pd eg ree(^; a, ol, P, N) is the sum 
of L binomial distributions, each referring to a different 
"mode" p. Therefore, the average degree distribution will 



show a multimodal behavior as long as two consecutive 
modes have disjoint supports. 

In general we expect that, when a is close to 0, peaks 
merge since, at low bias, the link probability weakly de- 
pends on p (see also Fig. IV), which is uniformity, while, 
when a is close to 1, most of the modes are off and the 
network is sparse. Therefore a very multimodal distribu- 
tion is expected only for intermediate values of a. Indeed, 
these arguments are corroborated by Fig. (|3| which shows 
different plots of Pd eg ree(^; a, ol, P, N) for different values 
of a; also notice that numerical data are finely fitted by 
the analytical calculation of Eq. ( fl5| ) . 

Consistent with the remarks in Sec. I, an increase in 
a basically encodes a set of ageing effects; in fact we can 
envisage that, in the antenatal period, when lymphocytes 
can not encounter antigens, a is close to 0, and the im- 
mune system is not specialized in any particular anti- 
gen response: There is no mult i- modal behavior in the 
degree distribution and the underlying B-cell network is 
roughly uncorrelated. Then, after the birth, lymphocytes 
start matching some antigens and undergo clonal expan- 
sion and, consequently, hypersomatic mutations: the sys- 
tem begins to specialize, and the connectivity starts to 
decrease. The degree distribution starts to exhibit a 
fine structure, where peaks correspond to specific sub- 
populations that have survived as a consequence of the 
antigen encountered in the past. 



B. Scaling in thermodynamics limit. 

As evidenced in the previous subsection, a crucially 
affects the topology of the idiotypic network. We there- 
fore investigate in more detail the global connectivity of 
the system in terms of Prink(&? P> ^0- To compute this 
quantity, one would plug Eq. ([sj) into Eq. ([9| and (12). 
However, since here we are interested in the large P limit 
and relative fluctuations in p decrease as 1/a/P, we can 
adopt a mean-field like approach and approximate Eq. 
as 

P(p;a,P) ~ P M f(p',cl,L) = S(p- p), (16) 
with p = (1 + a)P/2, so that Eq. ^ can be restated as 



MF 



(x;a,P) 



/(l+a)L/2\/(l-o)L/2\ 
I X/2 A x/2 ) 



( 



(17) 



(l+a)L/2J 



Moreover, exploiting the parity symmetry for a, we focus 
on the range a G [0,1]. Thus, Eq. ([9| can be approxi- 
mated as 



P\mk(a,a,L,N) 



—MF, _s 

P link (a, a, P, N) 



E 



(l-a)L 



\ y/2 A v/2 ) 



,(18) 



Now, since we are interested in scaling laws, we can ne- 
glect all terms in the sum, but the leading one. For in- 
stance, focusing on a < 1/2 (a > —1/2) and a > 1, it 
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FIG. 3: (Color on line) Degree distribution for different values of a and fixed a = 1.2, N = 10000, 7 = 10. From left to right: 
a = 0.1 (uni-modal behavior; the mean degree is about 0.8iV), a = 0.3 (multimodal behavior; the mean degree is 0.5 A/"), and 
a = 0.6 (there is no extensive network; the average degree is 0.03iV). 



is easy to see that this is given by x = L/2. Then, via 
Stirling approximation, we get 



log PimkO, a, L, N) ~ f(a)L, 



(19) 



where f(a) = (1 + a) log(l + a) + (1 - a) log(l - a) - (1 + 
2a) /4 log(l + 2a) - (1 - 2a) /4 log(l - 2a) is a symmetrical 
monotonically decreasing function from a = to a = 
±1/2 and roughly plays the role of an (compressed in 
the interval [—1/2, +1/2]) entropy of the bit-strings, such 
that, as in other biased approaches [42 , a can be thought 
of as the "bit-string magnetization". 
Therefore, we can write 



P link (a,a,L,N) ~ N^ a l 



(20) 



In this way, as the degree of bias a is increased, the graph 
turns from highly connected (Pn n k = 0(1)) to diluted 
CPiink ~ N~ 7C , with c = log(27/32)/2). Similar results 
can be found for a different parameter range. 



C. Clustering Coefficient 

The local clustering coefficient is defined as the number 
of triangles stemming from a node i over the maximum 
number of triplets centered on i itself (see, e.g., Refs. 
[3] |45j [46]). Due to the "anti-transitive" nature of the 
idyotipic network, triangles, i.e., 3-cycles, are expected 
to be unlikely, while quadrilaterals, i.e. 4-cycles, are ex- 
pected to be favored. Roughly speaking, an antibody 
Abl elicits its anti- antibody Ab2, which in turn elicits 
the anti-anti-antibody Ab3, whose structure should be 
close to Abl's, so that a 4-cycle finally develops. 

Indeed, this was partially shown in Ref. [24], where 
an idiotypic network and an analogous Erdos-Reny (ER) 



graph, namely, a purely random graph exhibiting the 
same average degree z were compared, obtaining that the 
former displays on average a significantly smaller num- 
ber of triangles. We now extend those results considering 
also 4-cycles (devoid of diagonals). Given a node i with Z{ 
nearest neighbors, the number of expected squares stem- 
ming from z, in the case where i belongs to a random ER 
graph and in the case where it belongs to our idiotypic 
network, are respectively 



{ER 



(N-l- Zi )p 2 (l-p), 



(21) 



Q( Zi )=^j(N-l-z i )p' 2 (l- P "), (22) 

where p = z/N, p f is the probability that in Q a neighbor 
of i is linked to a node not belonging to the ith neigh- 
borhood, and p" is the probability that two neighbors 
are linked. The latter is just the clustering coefficient 
for node z, which, as shown in Ref. [24], is smaller for 
a graph where links are based on complementarity fea- 
tures, so that (1—p) < (1 —p"). Moreover, the idiotypic 
and the ER graphs we are comparing display, by con- 
struction, the same coordination number. If we impose 
this condition to be true also for the average degree of 
any site j that is linked with i, we get 



= l+p'{zi-l)+p\N ■ 



2) (23) 



Therefore, as p > p h \ we get p < p' and finally Q(zi) > 
QER{zi). More generally, this suggests that in our idio- 
typic networks, as links are based on complementarity, 
4-cycles are motifs while 3-cycles are anti- motifs [24j [47] . 
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FIG. 4: Number of triangles (A) and of quadrilaterals (□) 
averaged over 100 realizations in our idiotypic network as 
a function of a. Parameters characterizing the network are 
N — 1000, 7 = 7 and a — 0.7. Curves represent the number 
of triangles (dashed line) and of squares (solid line) expected 
for an analogous ER graph. Inset: Number of isolated nodes 
present in the system; again, the idiotypic network (•) and 
ER graph (line) are compared. The latter displays a larger 
number of triangles as long as a giant component can be de- 
tected. 



Indeed, Fig. [4] numerically confirms that the number of 
quadrilaterals [triangles] appearing in our graph is larger 
[smaller] than the number expected for an analogous ER 



graph, estimated as ( 4 )p 4 (l — p) 2 , [( 



(N\ 



V. COUPLING AND WEIGHTED DEGREE 
DISTRIBUTIONS 



In Sec. [IVjwe studied the bare topology of the network, 
while here we focus on the properties related to the dis- 
tribution of weights Jij associated with links As 
mentioned above, these features retain a strong biological 
meaning. For instance, nodes displaying a high weighted 
degree feel, under normal conditions, a larger (internal) 
quiescent stimulus [36] . 



A. Coupling distribution 

Given two nodes i and j with pi and pj non-null en- 
tries, respectively, recalling Eqs. (2) and (3), the coupling 
distribution is 



P(Jij\pi,Pj',a,L) 



log(Jij)+L . 





Pi,Pj,L) if J k 
if Ja. 



> 1 
< 1 



,(24) 



where can span over [l,e aL ]. 

To obtain the mean coupling probability one should 
average Eq. (24), so P(x'<>Pi,Pj) of Eq. (JsJ) , over the 
binomial distribution of pi and Pj in Eq. 0. To give an 
analytical estimate of this quantity, as in section IV B[ 
one can use a mean- field like approach, namely Eq. (17). 




FIG. 5: (Color on-line) Average value of the coupling strength 
E[J] and of its variance Var[J], versus L for several choices of 
parameters, depicted in different colors. We considered a = 
0.7 (o), a = 1.0 (□) and a = 1.3 (A) for different values of 
a pertaining to a connected (upper panels) and disconnected 
(lower panels) regimes (see the legend). Symbols refer to data 
obtained from exact numerical calculations, while curves are 
drawn according to the approximation |27| 



Therefore, we can write the expressions: 

(l-a)L 



(X) MF = E xPmf(x) = ^^L, (25) 
x=o Z 

(l-a)L 

(J)mf= E P mf(x)J(x), (26) 

and, as long as (x)mf belongs to [a 2 , ^zj] we can ap- 
proximate 



(J)mf - j ((x)mf) = exp|[(a + l) 

= 7vt(«+ 1 ) i ^- 1 ]7 > 



I -a 2 



ML 



(27) 



Hence, (J) MF is expected to scale as a power of the sys- 
tem size and exponentially with a 2 . These results have 
been successfully checked by numerical simulations (see 
Figs. [5] and [6}. 



B. Weighted connectivity 



We now extend the bare degree Zi 



weighted degree referred to as Wi and defined as 



(28) 



This quantity has a strong thermodynamic meaning, in 
fact, recalling the Hamiltonian of Eq. (J6|, and assuming, 
for the sake of simplicity, the zero noise limit so that 
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FIG. 6: (Color on line) Average value of the coupling strength 
E[J] and of its variance Var[J], versus a for several choices 
of parameters, depicted in different colors. We considered 
a 0.7 (o), a 1.0 (□) and a 1.3 (A) and L 120 
(see the legend). Symbols refer to data obtained from exact 
numerical calculations, while curves are drawn according to 
the approximation (27). 



all lymphocytes in the neighborhood of i are quiescent, 



■T N J 



the local field acting on i is just (pi = — 2Jj=i J ij m j 
Wi. This remark allows to establish a correlation between 

i.e. 



weighted degree of node i and role of the z-th clone, 
either self or non-self addressed [24] . 

In general, W{ depends on the number of nearest- 
neighbors zi and on the coupling with each of them. So 
we define the weighted degree probability as follows: 



P(wi\pi,Zi;a,OL,L) = 

PiJM-^PiJiMSiwi-^Jij). (29) 

Jii----Ji Zi 3=1 

Now, averaging over all the possible numbers of 
nearest-neighbor, 

P(wi\pi] a, a,L) = ^2 p ( z i\Pu a, a, L)P(wi\piZi] a, a, L) 

(30) 

and averaging over nodes we obtain the mean degree dis- 
tribution 

P(w;a,a,L) = ^ P{wi\pi, a, a, L)P(pi\ a, L). (31) 



This is the theoretical description of the distributions in 
Fig. 7 which were obtained by numerical simulations of 
the network. The weighted degree distribution displa ys 
a fine structure similarly to Pdegree(^) (see Sec. (IV A)), 
and even a "hyperfine" structure. This is ultimately due 
to the fact that Eq. (29), being a sum of terms that can 



several multimodal distributions are superposed, giving 
rise to the complicated structure shown in Fig. 8. Oth- 
erwise stated, pi may univocally determine a range for 
the degree %i [leading to a multimodal Pdegree(^)], but 
Zi, in turn, does not univocally determine a range for Wi. 
The "reshuffling" between bare and weighted degrees can 
be seen in the scatter plots in Fig. 8 and it is mirrored 
by the non-trivial structure of P(w). As a result, nodes 
that are lazier in reacting to antigenic stimulation are 
not necessarily those with a large number of neighbors, 
but, rather, those with a large weighted degree; the two 
subsets cannot be trivially mapped into each other. 

It is worth emphasizing that, as the weighted degree is 
a sum of exponential factors, the support of P(w) covers 
several orders of magnitude, in both the connected and 
the disconnected regimes. This is consistent with the co- 
existence of agents highly (poorly) susceptible with re- 
spect to external stimuli, i.e. nodes with small (large) w. 
This difference has been attributed to the self-addressed 
or non-self-addressed attitude of lymphocytes [28j [30], 
and, interestingly, it also survives in the underpercolated 
regime. 

In particular, we can introduce the relations 



(w) ~ N(J), 
Var[w] ~ JVVar[J], 



(32) 



which hold as long as the couplings insisting on the same 
nodes can be approximated as independent. The expres- 
sions in Eq. (32) have been used to fit the numerical data 
in Fig. 

Finally, we stress that the support of P(w) remains 
spread over several order of magnitude also in the region 
of the parameter space where the network is underper- 
colated (see Sec. VI): This suggests that the existence 



of an extensive J erne-like network may not be strictly 
necessary for self /non- self discrimination in a systemic 
way. 



VI. BIAS AND SPECIFICITY 

As explained before, an increase in the bias parameter 
a corresponds to a progressive smoothing of the reper- 
toire variability. A growth in a also results in a dilution 
of the graph itself, eventually leading to percolation phe- 
nomena (see Fig. (11)). 

In order to study this process, we compare its features 
to those pertaining to an ER random graph Ger(N ,p), 
where links are drawn independently with probability 
p = z/N, in such a way that the average degree is the 
same for both networks. 



be localized in the w range, can, by itself, display a multi- 
modal distribution. When summing over pi in Eq. (31), 



Therefore, the evolution of Q{N,p) as p ranges from 
to 1, eventually leading to a percolation transition (see, 
e.g., [49, 50 ) is compared to the evolution of our graph 
Q (a, L, a) as a ranges from to 1. 
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FIG. 7: Semilogarithmic plot for the distribution P(w; a, a, L) obtained by averaging 100 systems made of N — 2 x 10 5 nodes 
with 7 = 7; results shown were averaged over 10 3 different realizations. We considered several values of a and a, corresponding 
to either over- or under- percolated regimes. 




2000 4000 5000 10000 15000 o 100 200 



FIG. 8: Each point shown corresponds to a node of the idiotypic network and its coordinates correspond to its weighted degree 
w and its bare degree z, respectively. The 100 system simulated is made up of N — 2 x 10 5 nodes and we fixed 7 = 7. We 
considered the same values of a and a used in Fig. 7. Notice that nodes displaying a different degree (and therefore a different 
p), may display the same weighted degree. The reshuffling among weighted and non-weigheted peaks is evident. 




FIG. 9: (Color on line) Average weighted degree E[w] and its 
variance Var[u>] versus a for a system of N = 10 4 sites, 7 = 7 
and a = 0.7 (o), a = 1.0 (□), and a = 1.3 (A). Symbols 
represent data from numerical sim ulations, while curves are 
the best fit obtained from Eq. (32). 



A. Giant component and distribution of cluster 
size 

In order to evaluate the impact of removing ties, we 
measure the relative size of the largest connected compo- 
nent S as a function of the fraction of links left / . 

In general, as a ranges from to 1, S gets continuously 
smaller: nodes with large p are those more likely to re- 
main isolated or to form small clusters. Differently from 
the ER case, beyond the giant component, clusters typi- 
cally display a small size. As evidenced in Ref. [45], these 
features give rise to a rather gentle percolation transition. 

In order to clarify this point, we focus on the evolution 
of the internal organization of clusters by measuring the 
distribution iV(a, s), representing the number of clusters 
of size s present when the correlation parameter is a. 



As shown in Fig. (10), as a grows, the typical clus- 
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FIG. 10: These plots describe the evolution of the network 
as a is tuned while we fix the system size N = 10000, 7 = 7, 
and a = 0.7 (solid line) and a — 1 (dashed line). Left panel: 
Average size of the giant component S/N. Right panels: rel- 
ative number of isolated nodes N(l,a)/N (upper panel) and 
average size of the non-giants components excluding isolated 
nodes (lower panel). 



ter size shrinks from N (a unique giant component) to 1 
(there are only isolated nodes). For instance, at a = 0.7, 
when a = 0.46 there are a few isolated nodes and a gi- 
ant component whose typical (normalized) size is around 
0.8. On the other hand, when a = 0.60 most nodes are 
isolated, S « 0.39 and the typical size of the remaining 
nodes is around 4.2. Beyond isolated nodes and the giant 
component, the statistics of cluster size is rather uniform, 
suggesting that minor disconnected clusters display small 
sizes, i.e., s < 10. 



B. Emergence of small components 

The difference between our idiotypic network and a 
random graph is emphasized by the analysis of the emer- 
gence of motifs around the the percolation threshold. We 
study the number of isolated k- loops, namely, unicycles 
of length fc, and isolated fc-stars, namely, subgraphs made 
of a central node connected to k nodes with unitary de- 
gree. In an ER graph these quantities follow 



(Nfc-star) =p k (l-p) 



(k+l)N-3k- 



(N kAoop )=p k (l-p) k ( N -V 



-i Uto(N-i + l) 
(k + l)\ 

Ulo(N-i + l) 
k\ 



> (33) 



(34) 



which are drawn for comparison in Fig. 12 In the 
idiotypic network the above-mentioned components are 
present in a smaller range of a, and the number of all 
stars, except the dimers (1-star), is eventually larger than 
ER predictions. This is due to the bias characterizing 
strings, as it makes the percolation transition smoother - 
in such a way that components not belonging to the giant 
component hardly develop - and induces inhomogeneity 
among nodes - in such a way that those associated with a 



value of p much lower than its average value are likely to 
act as (local) hubs. This picture is confirmed by the fact 
that /c-stars with large k have a higher mean coupling 
(see Fig. 12 , right). 

Moreover, in our simulations we never find isolated tri- 
angles (3- loop) or quadrangles (4-loop), which are instead 
present in random graphs. 



VII. CONCLUSIONS 

In this work we studied a class of weighted random 
networks aimed to describe the mutual interactions be- 
tween cells inside the B-branch of the immune system. 
As in previous works [25] [36] , the Burnet clonal expan- 
sion theory [3lJ [33] appears to be the standard one-body 
response of the system described by the Hamiltonian of 
Eq. ([6|, while the Jerne theory is ruled by the two-body 
term, whose coupling encodes the Varela prescription. 

The main novelty introduced is the existence of a bi- 
ased repertoire: The parameter tuning the "mean field 
similarity" among the epitopes (the entries in the bit- 
strings encoding for antibodies) is a scalar a ranging from 
(completely random) to 1 (completely deterministic). 
From a biological viewpoint, the extension to a value 
a is a step toard more realistic descriptions, since 
antibodies are not completely random objects [51] and 
since during ontogenesis in bone marrow, a part of the 
strongly self-reacting repertoire is killed [38] [52] . General 
findings [53] [54] and recent investigations (achieved in 
the study of the zebra fish repertoire [55 [26 ) show that 
the ensemble of the genetic heritage dedicated to anti- 
body formation is used in a highly inhomogeneous (Zipf- 
like) way, suggesting that some epitopes can be highly 
over-expressed compared to others. Although this would 
imply the use of an (with the price of a highly un- 
tractable mathematics), we skipped the fine dependence 
in favor of a simpler mean field choice — a ^ for 
all the epitopes [61]. This introduction of bias indeed 
gives rise to striking, at least qualitatively, effects: In a 
broad range of a ^ 0, the degree distribution displays 
a multi-modular structure which mirrors the history of 
the system itself. Such fringes in the (weighted) degree 
distribution were experimentally noted in the pioneering 
investigations by Varela and coworkers and an immuno- 
logical interpretation was provided in Refs. [28 , 30] [48]. 
Although based on synthetic data, our analysis extends 
these findings in some details, suggesting the existence 
of a hyperfine structure, richer than the one obtained by 
sampling over the experimentally available subset of the 
repertoire. Furthermore, the reshuffling among modes of 
the network raises the question of which should be the 
physical observable to characterize the reaction attitude 
of nodes: the bare degree (which is actually the standard 
one checked in experiments) or, rather, the weighted con- 
nectivitys which we claim to be the most relevant; We 
hope to report further investigation on this point soon. 
Here, we just stress that, while the average degree varies 
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FIG. 11: (Color on line) Realization of idiotypic networks made up of N = 10000 nodes with 7 = 3 and a = 0.7. Different values 
of a have been considered: from left to right a = 0.50, 0.55, 0.59, 0.62. Although these plots refer to one single realization, we 
have checked that the system displays robustness in this sense. 




a a 



FIG. 12: Left: Number of /c-stars (a dimer is a 1-star and a 
trimer is a 2-star) averaged over 100 numerical realizations 
of iV=10000 and a=0.7 idiotypic networks, while a (circles) 
is variated. Solid lines represent the ER prediction for the 
same quantity at the relative link probability. Right: Mean 
coupling value within a star. 



over a linear scale, the weighted degree displays a distri- 
bution with a log- normal like envelope, hence spanning 
several order of magnitude. This spread is robust as it 
holds also also for regimes of high dilution (corresponding 
to large a): Interestingly, this implies that mechanisms 
such as self/non-self discrimination may work despite a 
truely over-percolated network of B-cells. 

A progressive increase in a eventually leads to an 
under-percolated network, where nodes not belonging to 
the giant component typically form small-size clusters, 
which mirrors the cascades of anti-antibodies commonly 
seen in experiments (see e.g. [34 ). Further, the local 



topology displays squares as motifs and triangles as non- 
motifs, as expected from a complementary-based net- 
work, which is in agreement with experiments on idio- 
typic networks too. 

Moreover, we stress that the general picture in which 
the network starts with a high connectivity, then special- 
izing its responses lets the coordination number decrease 
(and a grows in our model) describes here a "thermody- 
namical growth process" as, once the network reaches the 
percolation threshold, its entropy remains positive even 
at a zero noise level: allowing a dynamics on a, that is 
slower with regard to the a, the evolution "naturally" 
shifts a to a nonzero value to maximize the entropy of 
the system [62] . 

In future investigations, several experimental-based re- 
finements and improvements are in order to achieve bet- 
ter predictive power. In particular, a change in the 
idiotope/paratope distributions toward mathematically 
challenging but biological more plausible choices [26] 
should be analyzed, and a clear relation among the ma- 
ture repertoire and the bias received during ontogenesis 
[56] should be determined too. 
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